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METHOD AND APPARATUS FOR TRANSLATING A CLASSIFICATION SYSTEM 

INTO A TARGET LANGUAGE 

Field of the Invention 

5 The present invention relates generally to methods and systems that classify 

spoken utterances or text into one of several subject areas, and more particularly, to methods and 
apparatus for automatically translating such classification systems from a source language into 
one or more target languages. 

10 Background of the Invention 

Many companies employ contact centers to exchange information with customers, 
typically as part of their Customer Relationship Management (CRM) programs. Automated 
systems, such as interactive voice response (IVR) systems, are often used to provide customers 
with information in the form of recorded messages and to obtain information from customers 

15 using keypad or voice responses to recorded queries. 

When a customer contacts a company, a classification system is often employed 
to classify spoken utterances or text received from the customer into one of several subject areas 
or classes. In the case of spoken utterances, the classification system will typically first convert 
the speech to text using a speech recognition engine. Once the communication is classified into 

20 a particular subject area, the communication can be routed to an appropriate call center agent or 
response team, as appropriate. For example, a telephone inquiry may be automatically routed to 
a given call center agent based on the expertise, skills or capabilities of the agent. 

A company must employ a different classification system for each language 
supported by the contact center. The process of generating a unique classification system for 

25 each supported language is typically significant, especially where the number of potential 
questions and possible answers associated with each call is large. Thus, a need exists for 
improved techniques for generating a classification system in several desired languages. A 
further need exists for a method and apparatus for automatically translating a classification 
system in a first source language into one or more additional target languages. 

30 
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Summary of the Invention 

Generally, a method and apparatus are provided for automatically translating a 
classification system in a first source language into one or more additional target languages. The 
classification system employs a sample response repository to learn to classify a communication 

5 into one of a plurality of predefined categories. The sample response repository comprises a 
plurality of prior communications each having a classification. The present invention translates 
the sample response repository using a mechanical translation process to generate a translated 
response repository. A natural language understanding process is then applied to the translated 
response repository to generate a natural language understanding module that can classify a 

10 communication in the target language. 

The natural language understanding process can employ statistical methods or a 
rule-base of classification rules that determine how communications are classified. In a rule- 
based implementation, the rule-base is also translated using a mechanical translation process to 
generate a rule-base in the target language. In implementations where spoken words must be 

15 converted to text for processing by the classification system, a speech recognition statistical 
model compilation process is applied to the translated target language response repository to 
generate a speech recognition module in the target language. 

A more complete understanding of the present invention, as well as further 
features and advantages of the present invention, will be obtained by reference to the following 

20 detailed description and drawings. 

Brief Description of the Drawings 

FIG. 1 illustrates a network environment in which the present invention can 

operate; 

25 FIG. 2 is a schematic block diagram of a conventional classification system that 

employs a statistical natural language understanding module; 

FIG. 3 is a schematic block diagram of a multi-lingual classification system 
incorporating features of the present invention; 

FIG. 4 is a flow chart describing an exemplary implementation of a classification 
30 system translation process incorporating features of the present invention; 
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FIG. 5 is a schematic block diagram of a conventional classification system that 
employs a rule-based natural language understanding module; and 

FIG. 6 is a schematic block diagram of a multi-lingual classification system that 
employs a rule-based natural language understanding module according to the present invention. 

5 

Detailed Description 

FIG. 1 illustrates a network environment in which the present invention can 
operate. As shown in FIG. 1, a customer, employing a telephone 1 10 or computing device (not 
shown), contacts a contact center 150, such as a call center operated by a company. The contact 

10 center 150 includes a classification system 200, discussed further below in conjunction with FIG. 
2, that classifies the communication into one of several subject areas or classes 180- A through 
180-N (hereinafter, collectively referred to as classes 180). Each class 180 may be associated, 
for example, with a given call center agent or response team and the communication may then be 
automatically routed to a given call center agent 180 based on the expertise, skills or capabilities 

15 of the agent or team. It is noted that the call center agent or response teams need not be humans. 
In a further variation, the classification system 200 can classify the communication into an 
appropriate subject area or class for subsequent action by another person, group or computer 
process. The network 120 may be embodied as any private or public wired or wireless network, 
including the Public Switched Telephone Network, Private Branch Exchange switch, Internet, or 

20 cellular network, or some combination of the foregoing. 

FIG. 2 is a schematic block diagram of a conventional classification system 200 
that employs a statistical natural language understanding module 250. Generally, the 
classification system 200 classifies spoken utterances or text received from customers into one of 
several subject areas. As shown in FIG. 2, the classification system 200 includes a sample 

25 response repository 210 that stores textual versions of sample responses that have been collected 
from various callers and previously manually classified into one of several subject areas. In one 
implementation, the sample response repository 210 is a domain specific collection of possible 
queries and associated potential answers, such as "How may I help you?" and each of the 
observed answers. 
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In an implementation that must process spoken utterances, the textual version of 
the responses in the sample response repository 210 are optionally processed at stage 220 to 
build a speech recognition module 230, in a known manner. For a detailed discussion of suitable 
techniques for building a speech recognition module 230, see, for example, Nuance Speech 
5 Recognition System, Version 8.0, Grammar Developer's Guide, Chapter 5: Say Anything: 
Statistical Language Models and Robust Interpretation,'' (Nuance Communications, Inc.; 1996- 
2001), incorporated by reference herein. It is noted that in an implementation that does not 
require processing of spoken utterances (i.e., in a text only implementation, for example, that 
processes only email and instant messages), the classification system 200 would not require the 

10 processing at stage 220 or the speech recognition module 230. 

In addition, the textual versions of the responses in the sample response 
repository 210 are automatically processed at stage 240 to create a statistical -based natural 
language understanding module 250. Generally, the classified textual versions of the responses 
210 are processed to look for patterns in the classifications that can subsequently be applied to 

15 classify new utterances. In one class of statistical-based natural language understanding modules 
250, for example, the natural language understanding module 250 generally consists of a root 
word list comprised of a list of root words and a corresponding likelihood (percentage) that the 
root word should be routed to a given destination (e.g., a call center agent 180). In other words, 
for each root word, such as "credit" or "credit card payment," the natural language understanding 

20 module 250 indicates the likelihood (typically on a percentage basis) that the root word should 
be routed to a given destination. 

For a detailed discussion of suitable techniques for call routing and building a 
natural language understanding module 250, see, for example, B. Carpenter and J. Chu-Carroll, 
"Natural Language Call Routing: a Robust, Self-Organizing Approach," Proc. of the Int'l Conf. 

25 on Speech and Language Processing, (1998); J. Chu-Carroll and R. L. Carpenter, "Vector-Based 
Natural Language Call Routing," Computational Linguistics, vol. 25, no. 3, 361-388 (1999); or 
V. Matula, "Using NL to Speech-Enable Advocate and Interaction Center", InAAU 2004, 
Session 624, March 13, 2003., each incorporated by reference herein. 

Generally, to train a statistical language model for a speech recognizer 230, the 

30 sample response repository 210, a corpus of transcriptions of sample utterances, is required. To 
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train a statistical model employed by the natural language understanding module 250 for 
classification requires as input the corpus of transcriptions of utterances 210. Each sample in the 
corpus 210 is "classified" by hand as to the routing destination for the utterance (i.e., if a live 
agent heard this response to a given question, where would the live agent route the call). The 
5 corpus of sample text and classification is analyzed during the training phase to create the 
internal classifier data structures that characterize the utterances and classes. 

As previously indicated, a contact center 150 must provide a different 
classification system 200 for each language supported by the contact center 150. The process of 
generating a unique classification system 200 for each supported language is significant, 

10 especially in domains where the number of potential questions and possible answers associated 
with each call is large (repositories on the order of 20,000 entries are not uncommon). Thus, 
according to one aspect of the invention, a method and apparatus are provided for translating a 
classification system 200 in a first source language into one or more additional target languages. 
In particular, one aspect of the invention uses mechanical translation tools, with their inherent 

15 limitations, to translate the input utterances or text samples into a parallel corpus in the target 
language. 

The present invention recognizes that the statistical processes (speech recognition 
and natural language classification processes) described above in conjunction with FIG. 2, can 
perform well in spite of word ordering errors, additional incorrect vocabularies, and small 

20 percentages of misclassifications. The present invention provides a mechanical, low cost 
mechanism for creating versions of classification applications in new target languages, when the 
underlying classification technology is based on robust statistical methods. While the 
mechanical translation process may not be fully accurate, the translated responses 210 are 
sufficiently accurate to create a natural language understanding module in the target language 

25 with sufficient accuracy. 

In addition, current speech translation tools, while they may generally translate 
words or phrases correctly, may (i) provide more than one possible translation for any given 
phrase, (ii) provide both correctly translated words and phrasing, as well as a number of words 
that while individually correct are not in the same order when used by a native speaker, or (iii) 

30 introduce words that are not correct translations for the input text, given the context of the text 
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(e.g., "What is the floor today?" may mean "What is the material under my feet today?" or it 
may mean "What is the lowest price seen for this item today?"). Nonetheless, the statistical 
approach in the natural language understanding module generally allows such variations to be 
properly classified. 

5 FIG. 3 is a schematic block diagram of a multi-lingual classification system 300 

incorporating features of the present invention. The multi-lingual classification system 300 
translates a classification system in a first source language, such as the classification system 200 
of FIG. 2, into one or more additional target languages. As shown in FIG. 3, the classification 
system 300 employs a mechanical translation process 310 to translate the original source 

10 language corpus (classified textual versions of utterances) 210 into one or more parallel corpuses 
315 in additional target languages. 

Once the original source language corpus 210 is translated into a parallel corpus 
315 in an additional target language, the multi-lingual classification system 300 optionally 
processes the target language classified responses 315 at stage 320 (in an implementation that 

15 must process spoken utterances) to build a speech recognition module 330 in the target language, 
in a known manner. 

In addition, the responses in the target language repository 315 are processed at 
stage 340 to create a natural language understanding module 350 in the target language. 
Generally, the classified textual versions of the responses in the target language repository 315 
20 are processed to look for patterns in the classifications that can subsequently be applied to 
classify new utterances in the target language. 

In one variation, manual examination of special-issue word lists (such as stop lists 
or ignore lists) may be useful in tuning the assistance dictionaries of the mechanical translation 
process 310, and in tuning the target language models. However, this approach should be much 
25 faster and more accurate than mechanical translation methods applied to other recognition or 
classification methods (such as finite state grammar ASR engines or rule-based classifiers). 

The mechanical translation process 310 may be embodied, for example, using the 
translation technologies of SYSTRAN Software Inc., San Diego, CA 92121, USA 
(www.systransoft.com). Among other benefits, the SYSTRAN translation system provides 
30 domain-specific translation libraries. In one variation, the mechanical translation process 310 
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can be updated to include the same idiomatic expressions employed by the speech recognition 
module 330. 

FIG. 4 is a flow chart describing an exemplary implementation of the 
classification system translation process 400 incorporating features of the present invention. 
5 Generally, the classification system translation process 400 provides multi-lingual versions of a 
classification system 200, such as a Customer Relationship Management (CRM) Natural 
Language Call Routing application, once a given language version has been created. 

As shown in FIG. 4, the classification system translation process initially obtains 
the sample response repository 210 in the source language during step 410 and then applies the 
10 mechanical translation process 310 to the sample response repository 210 in the source language 
during step 420. Thereafter, the classification system translation process optionally applies a 
speech recognition statistical model compilation process to the translated target language 
response repository 315 during step 430 to generate the speech recognition module 330, if 
necessary. Finally, the classification system translation process applies natural language 
15 understanding principles to the translated response repository 315 during step 440 to generate a 
statistical model in the target language for the natural language understanding module 350. 

FIG. 5 is a schematic block diagram of a conventional classification system 500 
that employs a rule-based natural language understanding module 550. The conventional 
classification system 500 of FIG. 5 is similar to the conventional classification system 200 of 
20 FIG. 2, other than the generation of the rule-based natural language understanding module 550. 

Thus, as shown in FIG. 5, the classification system 500 includes the same sample 
response repository 210 that was discussed above in conjunction with FIG. 2. In an 
implementation that must process spoken utterances, the textual version of the responses in the 
sample response repository 210 are optionally processed at stage 220 to build a speech 
25 recognition module 230, in the same manner as discussed above. 

In addition, the textual versions of the responses in the sample response 
repository 210 are manually processed at stage 540 to create a rule-based natural language 
understanding module 550. Generally, a person evaluates the textual versions of the responses 
210 and writes rules 545 in a known manner, that are generally based on keywords and indicated 
30 classifications. For example, a person may create a rule that indicates that if an utterance 
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includes the phrase "credit card" and the word "balance," then the communication should be 
assigned to a credit card balance class. 

FIG. 6 is a schematic block diagram of a multi-lingual classification system 600 
that employs a rule-based natural language understanding module according to the present 
5 invention. The multi-lingual classification system 600 translates a classification system in a first 
source language, such as the classification system 500 of FIG. 5, into one or more additional 
target languages. As shown in FIG. 6, the classification system 600 employs a mechanical 
translation process 310 to translate the original source language corpus (classified textual 
versions of utterances) 210 into one or more parallel corpuses 315 in additional target languages, 

10 in a manner described above in conjunction with FIG. 3. 

Once the original source language corpus 210 is translated into a parallel corpus 
315 in an additional target language, the multi-lingual classification system 300 optionally 
processes the target language classified responses 315 at stage 320 (in an implementation that 
must process spoken utterances) to build a speech recognition module 330 in the target language, 

15 in a known manner, described above in conjunction with FIG. 3. 

In addition, according to a further aspect of the present invention, the natural 
language understanding classification rules 545, manually generated in the manner described 
above in conjunction with FIG. 5, are also translated by a mechanical translation process 310 to 
generate natural language understanding classification rules 645 in the target language. 

20 Continuing the above example, the mechanical translation process 310 can translate the source 
language rule that indicates that if an utterance includes the phrase "credit card" and the word 
"balance," then the communication should be assigned to a credit card balance class, into the 
target language. 

As is known in the art, the methods and apparatus discussed herein may be 
25 distributed as an article of manufacture that itself comprises a computer readable medium having 
computer readable code means embodied thereon. The computer readable program code means 
is operable, in conjunction with a computer system, to carry out all or some of the steps to 
perform the methods or create the apparatuses discussed herein. The computer readable medium 
may be a recordable medium (e.g., floppy disks, hard drives, compact disks, or memory cards) or 
30 may be a transmission medium (e.g., a network comprising fiber-optics, the world-wide web, 
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cables, or a wireless channel using time-division multiple access, code-division multiple access, 
or other radio-frequency channel). Any medium known or developed that can store information 
suitable for use with a computer system may be used. The computer-readable code means is any 
mechanism for allowing a computer to read instructions and data, such as magnetic variations on 
5 a magnetic media or height variations on the surface of a compact disk. 

The computer systems and servers described herein each contain a memory that 
will configure associated processors to implement the methods, steps, and functions disclosed 
herein. The memories could be distributed or local and the processors could be distributed or 
singular. The memories could be implemented as an electrical, magnetic or optical memory, or 

10 any combination of these or other types of storage devices. Moreover, the term "memory" should 
be construed broadly enough to encompass any information able to be read from or written to an 
address in the addressable space accessed by an associated processor. With this definition, 
information on a network is still within a memory because the associated processor can retrieve 
the information from the network. 

15 It is to be understood that the embodiments and variations shown and described 

herein are merely illustrative of the principles of this invention and that various modifications 
may be implemented by those skilled in the art without departing from the scope and spirit of the 
invention. 
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